Probabilistic design of optimal sequential decision-making algorithms in learning and control

نویسندگان

چکیده

This survey is focused on certain sequential decision-making problems that involve optimizing over probability functions. We discuss the relevance of these for learning and control. The organized around a framework combines problem formulation set resolution methods. consists an infinite-dimensional optimization problem. methods come from approaches to search optimal solutions in space Through lenses this overarching we revisit popular control algorithms, showing naturally arise suitable variations mixed with different A running example, which make code available, complements survey. Finally, number challenges arising are also outlined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data mining for decision making in engineering optimal design

Often in modeling the engineering optimization design problems, the value of objective function(s) is not clearly defined in terms of design variables. Instead it is obtained by some numerical analysis such as FE structural analysis, fluid mechanic analysis, and thermodynamic analysis, etc. Yet, the numerical analyses are considerably time consuming to obtain the final value of objective functi...

متن کامل

Structure Learning in Human Sequential Decision-Making

Studies of sequential decision-making in humans frequently find suboptimal performance relative to an ideal actor that has perfect knowledge of the model of how rewards and events are generated in the environment. Rather than being suboptimal, we argue that the learning problem humans face is more complex, in that it also involves learning the structure of reward generation in the environment. ...

متن کامل

Scalable Algorithms for Multiagent Sequential Decision Making

Introduction In artificial intelligence, decision theory deals with computing a sequence of actions (policy) that an autonomous agent must take in order to optimize its rewards (obtain its goals in the most efficient manner). In many real world situation , an autonomous agent must deal with various sources of uncertainty while computing its optimal policy. In single agent settings, such decisio...

متن کامل

data mining for decision making in engineering optimal design

often in modeling the engineering optimization design problems, the value of objective function(s) is not clearly defined in terms of design variables. instead it is obtained by some numerical analysis such as fe structural analysis, fluid mechanic analysis, and thermodynamic analysis, etc. yet, the numerical analyses are considerably time consuming to obtain the final value of objective functi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Annual Reviews in Control

سال: 2022

ISSN: ['1872-9088', '1367-5788']

DOI: https://doi.org/10.1016/j.arcontrol.2022.09.003